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Abstract. In this paper we study sum-free sets of order m in finite Abelian groups. We 
prove a general theorem on 3-uniform hypergraphs, which allows us to deduce structural 
results in the sparse setting from stability results in the dense setting. As a consequence, 
we determine the typical structure and asymptotic number of sum-free sets of order m 
in Abelian groups G whose order is divisible by a prime q with q = 2 (mod 3), for every 
m C(q)\/n\ogn 1 thus extending and refining a theorem of Green and Ruzsa. In particular, 
we prove that almost all sum- free subsets of size m are contained in a maximum-size sum- free 
subset of G. We also give a completely self-contained proof of this statement for Abelian 
groups of even order, which uses spectral methods and a new bound on the number of 
independent sets of size m in an (n, d, A)-graph. 



1. Introduction 

An important trend in Combinatorics in recent years has been the formulation and proof 
of various 'sparse analogues' of classical extremal results in Graph Theory and Additive 
Combinatorics. Due to the recent breakthroughs of Conlon and Gowers [10] and Schacht [34J , 
many such theorems, e.g., the theorems of Turan [37J and Erdos and Stone [14J in extremal 
graph theory, and the theorem of Szemeredi [36] on arithmetic progressions, are now known 
to extend to sparse random sets. For structural and enumerative results, such as the theorem 
of Kolaitis, Promel and Rotshchild [22] which states that almost all i^ r+ i-free graphs are 
r-colorable, perhaps the most natural sparse analogue is a corresponding statement about 
subsets of a given fixed size m, whenever m is not too small. In this paper, we prove such a 
result in the context of sum-free subsets of Abelian groups and provide a general framework 
for solving problems of this type. To be precise, we obtain a sparse analogue of a result 
of Green and Ruzsa [18], which describes the structure of a typical sum- free subset of an 
Abelian group. 

Sparse versions of classical extremal and Ramsey-type results were first proved for graphs 
by Babai, Simonovits and Spencer [5], and for additive structures by Kohayakawa, Luczak 
and Rodl [21] , and in recent years there has been a tremendous interest in such problems (see, 
e.g, [121 EDI EI] ) • Mostly, these results have been in the random setting; for example, Graham, 
Rodl and Rucihski [T7] showed that if p ^> 1/ y/n, and B C Z n is a p-random subset, then 
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1 A p-random subset of a set A is a random subset of X, where each element is included with probability 
p, independently of all other elements. 
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with high probability every 2-colouring of B contains a monochromatic solution of x + y = z. 
The extremal version of this question was open for fifteen years, until it was recently resolved 
by Conlon and Gowers [10] and Schacht [34] . 

For problems of the type we are considering, results are known only in a few special cases. 
Most notably, Osthus, Promel and Taraz [27], confirming (and strengthening) a conjecture 
of Promel and Steger [29], proved that if m ^ + e)n 3//2 i/log n then almost all triangle- 
free graphs with m edges are bipartite; moreover, the constant a/3/4 is best possible. This 
result can be seen as a sparse version of the classical theorem of Erdos, Kleitman and 
Rothschild [12], which states that almost all triangle-free graphs are bipartite. A similarly 
sharp result was proved by Friedgut, Rodl, Rucihski and Tetali [13] for the existence of 
monochromatic triangles in two-colourings of G n>p . It is an interesting open problem to 
prove such a sharp threshold in the setting of Theorem 11.1] below. 

A set A C G, where G is an Abelian group, is said to be sum-free if (A + A) D A — 0, 
or equivalently, if there is no solution to the equation x + y = z with x,y,z G A. Sum-free 
subsets of Abelian groups are central objects of interest in Additive Combinatorics, and have 
been studied intensively in recent years. The main questions are as follows: What are the 
largest sum-free subsets of G7 How many sum-free sets are there? And what does a typical 
such set look like? Over forty years ago, Diananda and Yap [UJ determined the maximum 
density fi(G) of a sum-free set in G whenever |G| has a prime factor q ^ 1 (mod 3), but it 
was not until 2005 that Green and Ruzsa [18] completely solved this extremal question for all 
finite Abelian groups. On the second and third questions, Lev, Luczak and Schoen [26J and 
Sapozhenko [33J determined the asymptotic number of sum-free subsets in an Abelian group 
of even order by showing that almost all such sets^ lie in the complement of a subgroup 
of index 2. Green and Ruzsa [18] extended this result to Abelian groups which have a 
prime factor q = 2 (mod 3), and showed also that a general finite Abelian group G has 
2 (i+ (i)MG)|G| sum -f r ee subsets. 

We say that G is of Type I if \G\ has a prime divisor q with q = 2 (mod 3), and Type 1(g) 
if q is the smallest such prime. Diananda and Yap [TT] proved that if G is of Type 1(g), 
then fi(G) — (q + l)/(3g); moreover, they described all sum-free subsets of G with fJ,(G) \G\ 
elements. Green and Ruzsa [TS] determined the asymptotic number of sum-free subsets of 
an Abelian group G of Type I, by showing that almost every sum-free set in G is contained 
in some sum-free set of maximum size. Balogh, Morris and Samotij [B] studied p-random 
subsets of such groups, and showed that if p ^ C(q)^/\ogn/n, G is an Abelian group of 
Type 1(g) and order n, and G p is a p-random subset of G, then with high probability every 
maximum-size sum-free subset of G p is contained in some sum-free subset of G of maximum 
size. In the case G = Z 2n , they showed that if p ^ +e) \/ n log n and A C G is a p-random 
subset, then with high probability the unique largest sum- free subset of is A D 02 n , where 
02n Q ^2n denotes the set of odd residues modulo 2n. Moreover, the constant 1 / \/3 in this 
result is best possible. 



We say that almost all sets in a family T of subsets of G satisfy some property V if the ratio of the 
number of sets in T that have V to the number of all sets in T tends to 1 as \G\ tends to infinity. 
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Let us denote by SF(G,m) the collection of sum-free subsets of size m in a finite Abelian 
group G. The following theorem refines the result of Green and Ruzsa [TS] to sum-free sets 
of fixed size m, provided that m ^ C(q)\fn logn. 

Theorem 1.1. For every prime q = 2 (mod 3), there exists a constant C(q) > such 
that the following holds. Let G be an Abelian group of Type I(q) and order n, and let 
m ^ C(q)y/n logn. Then almost every sum-free subset of G of size m is contained in a 
maximum-size sum-free subset of G, and hence 

|SF(G,m)| = X q ■ (^{elements of G of order g} + o(l)) 

as n — )■ oo, where X q = 1 if q = 2 and X q = 1/2 otherwise. 

Although the factor X q ■ ^{elements of Gof order q} above may appear mysterious, it is 
a natural consequence of the characterization of maximum-size sum-free sets in groups of 
Type I, see Theorem 16.21 We remark that the lower bound m ^ C(q)^n logn is sharp up to 

a constant factor, since there are at least (n/2)(/z(G)n — 3m) m /(m — 1)! sum-free subsets 
of G which contain exactly one element outside a given maximum-size sum-free subset of G, 
and this is larger than m 1//5 ( M< ^ n ) if m ^ \^J n logn. (Here, and throughout, log denotes 
the natural logarithm.) Hence, assuming m 1//5 is much larger than the number of elements 
of order q in G, almost no sum-free subset of G of this size is contained in a maximum-size 
sum-free subset of G. 

We shall prove Theorem 11.11 using a new theorem (see Section [2]) which describes the 
structure of a typical independent set in a 3-uniform hypergraph "H that satisfies a certain 
natural 'stability' property, see Definition ^. II The key ingredient in the proof of this theorem 
is a new method of enumerating independent sets in 3-uniform hypergraphs. We shall also 
use a simplified version of this method to prove a new bound on the number of independent 
sets in a certain class of expander graphs known as (n, d, A)-graphs. 

First, let us recall the definition of (n, d, A)-graphs, which are an important class of ex- 
panders; for a detailed introduction to expander graphs, we refer the reader to [4J or [19J. 
Given a graph Q, let Ai ^ . . . ^ A n denote the eigenvalues of the adjacency matrix of Q. We 
call max{|A2|, |A„|} the second eigenvalue of Q. 

Definition 1.2 ((n, d, A)-graphs). A graph Q is an (n, d, A)-graph if it is <i-regular, has n 
vertices, and the absolute value of each of its nontrivial eigenvalues is at most A. 

Alon and Rodl [3] gave an upper bound on the number of independent sets in an (n, d, X)- 
graph, and used their result to give sharp bounds on multicolour Ramsey numbers. When 
A = Q(d) (as n — > oo), Theorem 11.31 below provides a significantly stronger bound than that 
of Alon and Rodl, for a wider range of m; it is moreover asymptotically sharp. In fact, we will 
not assume anything about the second eigenvalue of a graph Q as our bound on the number 
of independent sets of Q will depend only on the smallest eigenvalue of Q. Given a graph 
Q, let A((/) be the smallest eigenvalue of the adjacency matrix of Q (denoted by A n above) 
and let I(Q,m) be the number of independent sets of size m in Q. Observe that A((?) < 
for every non-empty Q and that, by definition, every (n, d, A)-graph satisfies A(£) ^ —A. 
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Theorem 1.3. For every e > 0, there exists a constant C = C(e) such that the following 
holds. If Q is an n-vertex d-regular graph with X(G) ^ —A, then 

I(g,m) <:(^ + £ ^ n ) 
\ m J 

for every m ^ Cn/d. 

We remark that the constant in Theorem 11.31 is best possible, since there exist n- 
vertex (i-regular graphs with \{Q) #5 —A and oi{Q) = -^^n for many values of n, d and A 
(here ct(Q) denotes the independence number of Q). For example, consider a blow-up of the 
complete graph K t +±, where each vertex is replaced by a set of size n/(t + 1) and each edge 
is replaced by a random d/i-regular bipartite graph with colour classes of size n/(t + 1) each. 
This blown- up graph Q is d- regular and, with high probability, it satisfies = —d/t and 
a{Q) = n/{t + l). 

In Section [7J we shall use Theorem 11.31 together with some basic facts about characters 
of finite Abelian groups, to give a completely self-contained proof of Theorem 11.11 in the 
case q = 2. For previous results relating the problem of estimating the number of sum- 
free subsets of groups, to that of estimating the number of independent sets in regular 
graphs, see for example [Tj, [261 [33] . For other results on counting independent sets in graphs 
and hypergraphs, see Balogh and Samotij [TJ [8], Carroll, Galvin and Tetali [9], Galvin and 
Kahn [16], Kahn [2TJ[22], Peled and Samotij [28], Sapozhenko [32] and Zhao [38] . 

The rest of the paper is organised as follows. In Section [2], we state our structural theorem 
for 3-uniform hypergraphs, and in Section [3] we prove Theorem 11.31 In Sections H] and 
we prove the structural theorem, and in Sections E] we shall apply it to prove Theorem 11.11 
Finally, in Section [TJ we shall prove Theorem 11.11 again in the case q = 2. 

2. A STRUCTURAL THEOREM FOR 3-UNIFORM HYPERGRAPHS 

In this section we shall introduce our main tool: a theorem which allows us to deduce 
structural results for sparse sum-free sets from stability results in the dense setting. Since 
we wish to apply our result to sum-free sets in various Abelian groups, we shall use the 
language of general 3-uniform (sequences of) hypergraphs H = (T-L n )nen, where |V("H n )| = n. 
Throughout the paper, the reader should think of H as encoding the Schur triples (that is, 
triples (x, y, z) with x + y = z) in an additive structure. 

We now define the stability property with which we shall be able to work. Let a G (0, 1) 
and let B = (i3 n ) n£ N, where B n is a family of subsets of V(H n ). We shall write \B n \ for the 
number of sets in B n , and set ||B„|| = max{|£>| : B G B n }. 

Definition 2.1. A sequence of hypergraphs "H = ("H n )neN i s s& id to be (a, £>)-stable if 
for every 7 > there exists > such that the following holds. If A C V(H n ) with 
\A\ >(a- (3)n, then either e{U n [A\) ^ 0e(H n ), or \A \ B\ < for some B G B n . 

Roughly speaking, a sequence of hypergraphs (H n ) is (a, £>)-stable if for every A C V(l-i n ) 
such that \A\ is almost as large as the extremal number for T-t n (i.e., the size of the largest 
independent set), the set A is either very close to an extremal set B G B n , or it contains 
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many (i.e., a positive fraction of all) edges of H n . Observe that classical stability results, 
such as that of Erdos and Simonovits [T3"| I3"B] . are typically of this form. 
We shall need two further technical conditions on H. Let 

A 2 ("Hn) = max I {e G H n : T C e} I , 

TCV(«„),|T|=2 1 1 J 1 

and note that if "H encodes Schur triples then A 2 ("H n ) ^ 3. Also define 

5{1-L n , B n ) := min min I je G "H n : |e fl 51 = 2 and t> G e) I , 

B&Bn v<=V(H n )\B 11 J 1 

and, as usual, write a{H n ) for the size of the largest independent set in % n . 
The following theorem is the key step in the proof of Theorem 11.11 



Theorem 2.2. Let % = (H n )neN be a sequence of 3 -uniform hypergraphs which is (a,B)- 
stable, with e(T-L n ) = Q(n 2 ) and A 2 (H n ) = 0(1). Suppose that \B n \ = n oi - l \ that a(1-L n ) ^ 
||£> n || ^ an, and that 8{% ni B n ) = Q(n). Then there exists a constant C = C("H, B) > such 
that if 

m ^ C^n log n. 

then almost every independent set in T-t n of size m is a subset of some B G B n . 

We shall prove Theorem 12.21 in Sections H] and |5l In Section EJ we shall use it to prove 
Theorem 11.11 



3. Independent sets in regular graphs with no small eigenvalues 

As a warm-up for the proof of Theorem 12. 2\ we shall prove a bound on the number of 
independent sets in regular graphs with no small eigenvalues, Theorem II. 3[ which improves 
a theorem of Alon and Rodl [3]. This result will be a key tool in our self-contained proof 
of Theorem 11.11 in the case q = 2, see Section [3 Moreover, many of the ideas from the 
proof of Theorem 11.31 will be used again in the proof of Theorem 12.21 We remark that the 
technique of enumerating independent sets in graphs used in this section was pioneered by 
Kleitman and Winston (23] and our proof of Theorem 11.31 below, requires little more than 
their original method. 

Given a graph Q on n vertices, and an integer m G [n], let I{Q) denote the number of 
independent sets in Q, and recall that m) denotes the number of independent sets of size 
m in Q. Alon pQ proved that if Q is a <i-regular graph on n vertices, then I(Q) ^ 2™/ 2+ °( n ) (as 
d — > oo), resolving a conjecture of Granville (see [I]), and suggested that the unique Q that 
maximizes I{Q) among all such (i.e., n- vertex rf-regular) graphs might be a disjoint union 
of copies of Kd t d- This conjecture was proven (using the entropy method) by Kahn [21] for 
bipartite graphs, and recently in full generality by Zhao (38]. 

Theorem 3.1 (Kahn |21j . Zhao [38]). Let Q be a d-regular graph on n vertices. Then 

1(G) ^ (2 d+1 - l) n ' 2 \ 
where equality holds if and only if Q is a disjoint union of copies of Kd t d- 
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Since —d is an eigenvalue of K d ^ dl any (i-regular graph Q containing a copy of K d d satisfies 
X(G) = —d. One might hope that a stronger bound on 1(G) holds for (i-regular graphs Q 
with \(G) > —d. Alon and Rodl [3] proved such a bound on I(G, m) for the slightly narrower 
class of (n, d, A)-graphs and used their result to give sharp bounds on Ramsey numbers. 

Theorem 3.2 (Alon and Rodl |3J). Let Q be an (n,d, X)-graph. Then 

Ttn . / emd 2 \ 2{n/d)Xogn [2\n/d> 
I(G,m) ^ 

for every m ^ 2(n/d) logn. 



4An log n J \ m 



If A = fl(d) as n — > oo, then Theorem 11.31 improves the above result in three ways: 
it provides a stronger bound for a wider range of values of m in a wider class of graphs. 
Theorem 11.31 is an immediate consequence of the following theorem, combined with the 
Alon-Chung lemma (Lemma 13 A\ below). 

Theorem 3.3. For every e, 5 > 0, there exists a constant C = C(e, 5) such that the following 
holds. Let Q be a d-regular graph on n vertices, and suppose that 2e(A) ^ for every 

A C V(G) with \ A\ ^ (a + 5)n. Then 



I(G,m) < 

for every m ^ Cn/d. 



[a + 28)n 
m 



The assumption in Theorem 13 . 3 1 that 2e(A) ^ e| A|d might seem somewhat strong; however, 
it follows from the Expander Mixing Lemma that it is satisfied by every (n, d, A) -graph with 
A/ (d+X) ^ a. For two sets S, T C V(G) , let e(S, T) denote the number of pairs (x, y) G SxT 
such that {x,y} G E(G). In particular, we have e(S,S) = 2e(S) for every S C V(G). The 
following result is proved^ in [2] . 

Lemma 3.4 (Alon-Chung [2J). Let Q be an n-vertex d-regular graph. Then for all A C V(G), 

2e(A)>-\A\ 2 + ^-\A\(n-\A\). 

n n 

We first deduce Theorem 11.31 from Theorem 13.31 and Lemma 13.41 

Proof of Theorem \1.3\ . We claim that if Q is an n-vertex <i-regular graph with X(G) ^ —A 
and a = X/(d + A), then 2e(A) ^ e\A\d for every A C V(G) with \A\ ^ (a + e)n. This 
implies that ^ satisfies the assumption of Theorem 13.31 (with S = e), and so the theorem 
follows. 

To prove the claim, suppose that A C V(G) satisfies \A\ ^ (a + £)n. By Lemma [3.41 

2e(A) ^ -■ \A\ 2 - - ■ \A\(n - \A\) = ^ \(d + X)\A\ - Xn 
n n ' n i 



3 Although the result is stated in [2] in a slightly different form, its proof there in fact implies Lemma [3T4l 
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Our lower bound on \A\ and the choice of a = X/(d + A) now give 

2e(A) ^ \A\ (d + X)(a + e) — X = e\A\(d + X) ^ e\A\d, 
as required. □ 



In the proof of Theorem 13.31 we shall use an algorithm which uniquely encodes every 
independent set I of size m in Q as a pair (S, I\S), where \S\ ^ 2n/ed ^ 2m/ Ce, and I\S 
is contained in some set A, with \A\ ^ (a + which depends only on S. At all times, 
it maintains a partition of V(Q) into sets S, X, and A (short for Selected, eXcluded, and 
Available), such that S <Z I <Z AU S. 

At each stage of the algorithm, we will need to order the vertices of A with respect to their 
degrees. For the sake of brevity and clarity of the presentation, let us make the following 
definition. 

Definition 3.5 (Max-degree order). Given a graph Q and a set A C V(Q), the max-degree 
order on A is the following linear order (t> 1? . . . ,V\m) on the elements of A: For every % G 
{1, . . . , \A\\, Vi is the maximum-degree vertex in the graph Q[A \ {t>i, . . . , i>j-i}]; we break 
ties by giving preference to vertices that come earlier in some predefined ordering of V(Q). 

We are now ready to describe the Basic Algorithm. 

The Basic Algorithm. Set A = V{Q) and S = X = 0. Now, while \A\ > (a + S)n, we 
repeat the following: 

(a) Let i be the minimal index (in the max-degree order on A) such that G /. 
(6) Move Vi from A to S. 

(c) Move v i, . . . , Vi-i from A to X (since they are not in I by the choice of i). 

(d) Move N(vi) from A to X (since I is independent and Vi G /). 

Finally, when \A\ ^ (a + 5)n, we output S (which is a subset of V(G)) and I \ S (which is 
a subset of A). 

We remark that, as well as in [23], algorithms similar to the one above have been considered 
before to bound the number of independent sets in graphs [31 [32] and hypergraphs [TJ [8]. 

Proof of Theorem \3.3\ . The theorem is an easy consequence of the following two statements: 



i(g, m ) ^jr( 

t=l \ 



n\ f(a, + S)n 
t ) [ m-t 



where t = — ; + 1, and 

ed 



n\ f(a + 5)n\ ^ l_f(a + 26)n\ 
t J \ m — t J ^ m\ m J 

if t ^ 2n/ed and m ^ Cn/d. We shall prove ([T|) using the Basic Algorithm; ([2]) follows from 
a straightforward calculation. 

To prove (pQ), let t G N be the number of elements of S at the end of the algorithm, 
and note that we have at most m choices for S. Now, crucially, we observe that A is 
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uniquely determined by S (given the original ordering of V(Q)); indeed, step (a) of the Basic 
Algorithm requires no knowledge of I, only of S and Q . Since \A\ ^ (a + S)n, it follows that 
we have at most (^_f] n ) choices for I \S. 

It will thus suffice to show that, given our assumptions on Q, the algorithm terminates in 
at most ^o = ^ + l^|^ steps. We shall show that A loses at least ed elements at each 
step of the algorithm (except perhaps the last), from which this bound follows immediately. 
Indeed, consider a step of the algorithm (not the last), in which a vertex Vi is moved to S, 
and set A' — A \ {v\, . . . , Since this is not the last step, we have \A'\ ^ (a + S)n, and 

so 2e(A') ^ e|A'|d, by our assumption on Q. Thus \N(vi) fl A'\ ^ ed, since Vi is the vertex 
of maximum degree in and hence A loses at least ed elements in this step, as claimed. 

To prove ([2]), we use the fact that, if t is replaced by t + 1, then the left-hand side is 
multiplied by 

n — t m — t 

t+ 1 (a + S)n-m + t + 1' ^ ' 

We claim that (j3J) is at most |- (y) 2 . To prove this, we consider two cases: if m ^ (a + S/2)n, 
then (jHJ) is at most 

n 2m 2m 
t Sn St 
while if m > (a + S/2)n then it is at most 

n m 2 /m\ 2 
1 ' T ^ S' \TJ ' 

since n ^ 2m/S, and our assumptions imply that a(G) < (a + 5)n, so we may assume that 
m < (a + S)n. Thus, for each t with t ^ t ^ ^2, 

/n\/(a + <5)n\ /(a + <5)n\ -A- 2 /m\2 / J\ 2 (2m\ 2 '° /( Q + «J) n 
\t)\ m-t J ^ V ™ / JLi 5 ^ r ' ^ V to! / V 6 / V m 

Since Q ^ (a) C (c) ^ or ever y a>6>c^0, &;!> (k/e) k for every /c G N, the function 
t !— >■ (fp) ° is increasing on the interval (0,m), and to ^ ~^ ^ f=£j this is at most 

f2em\ 2t ° f(a + S)n\ < fCee\^ m ( a + S \ m f(a + 2S)n\ < 1_ f(a + 2S)n\ 
\St ) \ m J ^ \ S J \a + 2SJ \ m ) ^ m \ m J 

as required. In the final inequality we used the fact that C is sufficiently large as a function 
of S and e, and that m is sufficiently large (as a function of S), since m ^ Cn/d. □ 

4. Algorithm argument 

In this section, we shall introduce a more powerful algorithm than that used in Section El 
We shall use this algorithm in the proof of Theorem 12 .21 to bound the number of independent 
sets which contain at least Sm elements of V(H n ) \ B for every B G B n . We shall show that 
when m 3> ^/n, then the number of such independent sets is exponentially small. The model 
example that the reader should keep in mind when reading this section is when TL n is the 
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hypergraph of Schur triples in an n-element Abelian group of Type 1(g), where q is some 
prime satisfying q = 2 (mod 3). 

Given a hypergraph % n , a family of sets B n , and 5 > 0, we define 

SFf(H n , B n , m) := | J G SF(H n , m) : |J \ B| ^ 5m for every B G i3 n |, 

where SF("H n ,m) denotes the collection of independent sets in H n of size m. The following 
theorem shows that there are few independent sets in T-i n (i.e., sum-free sets) of size m which 
are far from every set B G B n . 

Theorem 4.1. Let a > and let % = ("H n )neN be a sequence of 3 -uniform hypergraphs 
which is (a,B)-stable, has e(H n ) = Q(n 2 ) and A 2 ("H„) = 0(1). If \\B n \\ ^ an, then for 
every 5 > 0, there exists a C > such that the following holds. If m ^ C^fn and n is 
sufficiently large, then 

\SF^ (U n ,B n ,m)\ ^ (2-™ + 5 m \B n \)(^ B ^j 

for some e = e(H, 5) > 0. 

We shall describe an algorithm which encodes every independent set / in 1-i n and produces 
short output for every / G SF^(Hn,B n ,m). As before, our algorithm will maintain a 
partition of V(T-L n ) into sets S, X, and A (short for Selected, eXcluded, and Available), 
such that S C I C A U S. We shall also maintain a set T C S (for Temporary), and the 
corresponding graph Q T , i.e., the graph with vertex set V{T-L n ), and edge set 

E{Q T ) = |{«, v) C V(^t) : {«, w} G K n for some w G t|. 

We shall frequently consider the max-degree order (defined in Section [3]) on the vertices of 
the graph £/r[.A]. 

4.1. The Algorithm. The idea of the algorithm is quite simple: we apply the Basic Algo- 
rithm of Section [3] to the graph £?t[^4] as long as it is reasonably dense. If £?t[^4] becomes too 
sparse, then there are four possibilities: either we have arrived at a set A which has at most 
(a — 0)n elements, or a set A which is almost contained in some B G B n ; or if not, then we 
can use the (a, £>)-stability of % to find either a new set T for which Qt is dense (see Case 2 
below), or a set of linear size that contains very few elements of /. Therefore, after moving 
relatively few vertices of T-L n to S, our choice for / \ S is limited to a set A C V(Q) that is 
either small or almost contained in some B G B n . It follows that if I was far from every 
B G B n , then (in both cases) the number of ways to choose I \ S from A is very small. 

We begin by choosing some constants. Let 7 > and note that since % is (a, £>)-stable, 
there exists (3 > so that if \A\ > (a - (3)\V (H n ) \ and \ A\B\> l\V(H n )\ for every B G B n , 
then e("H n [yl]) ^ (3e(H n ). Let us choose (3 > sufficiently small so that e(% n ) ^ (3n 2 and 
^(Hn) ^ 1//3 for all sufficiently large n. Let C = C(f3) > be sufficiently large, and set 

Cn m n 

d = ^ — ^ — . 

m C C 

We are ready to describe the Main Algorithm; this is the key step in our proof of Theorem 12. 2 1 
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The Main Algorithm. We initiate the algorithm with T — S C /, a deterministically 
chosen subset of / of size d (the first d elements of I in our ordering of V(H n ), say), and 
with A = V(H n ) \ S and X = 0. Now, while \A\ > (a — (3)n and \A \ B\ > for every 
B G B n , we repeat the following steps: 

Case 1: If the average degree in Qt[A] is at least (3 4 d, then: 

(a) Let i be the minimal index in the max-degree order on V(C/t[/1]) such that V{ G I. 

(b) Move Vi from A to S. 

(c) Move v i, . . . , Vi-i from A to X (since they are not in I by the choice of i). 

(d) Move N(vi) from A to X (since / is independent and Vi £ /). 

Case 2: If the average degree of {?t[^4] is less than /3 4 d, then we find a new set T as follows. 
Since TL is (a, B)-stable, |A| > (a — (3)n, and |A\.B| > for every S e B n , then A contains 
at least /3e(% n ) edges of "H n . Set 

Z =\z^A: e(g z [A}) ^ /3 2 n}, 

where £7z[A] is the graph with vertex set A and edge set {{x,y} : {x, y, z} G £(H n )}. We 
call the elements of Z useful. Since e(T-L n ) ^ /3n 2 , we have e(% n [A]) ^ /3 2 n 2 , and so 

=3e(^,[A]) ^3/3 2 n 2 . 

Moreover, we have e(^ 2 [A]) ^ A("H n ) ^ A 2 ('H n )n ^ n//3 for every z G Thus, by the 

pigeonhole principle, it follows that |Z| ^ 2j3 3 n. 
Now we have two subcases: 

(a) If / contains fewer than d useful elements, then move these elements from A to S 
and move the other useful elements from A to X. 

(b) If I contains more than d useful elements, choose d of them u\, . . . , (the first d in 
our ordering, say) and move them from A to S. Moreover, set T = {ui, . . . , uA-- 

Finally, when \A\ ^ (a — (3)n, or \A \ B\ ^ 'jn for some B G B n , then we output S (which 
is a subset of V(Q)) and I \ S (which is a subset of A) and stop. 

We shall show that the Main Algorithm encodes at most (3 2 m elements of I in S, and that 
S determines A. Theorem 14. II then follows from some simple counting. 

4.2. Proof of Theorem 14.11 We begin by proving three straightforward claims about the 
Main Algorithm; these, together with some simple counting, will be enough to prove the 
theorem. The following statements all hold under the assumptions of Theorem 14.11 

Claim 1. The Main Algorithm passes through Case 1 at most 2n/((3 A d) times, and through 
Case 2 at most 1//3 5 times. 

Proof. We prove the second statement first. To do so, simply observe that each time we pass 
through Case 2(a), we move at least 2/3 3 n — d^ (3 3 n vertices from A to X, and each time we 
pass through Case 2(6), we obtain a graph Qt[A] with at least / S 2 nd/A 2 ('Hn) — 0(d 2 ) ^ 2(3 4 nd 
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edges. In the latter case, we must remove at least fi A nd edges from (?t[A| before we can return 
to Case 2. Since A 2 (TL n ) < 1/(3, it follows that A(C? r ) sC \T\/(3 = d/(3, since if xy G E{Q T ) 
then there exists zeT such that {x,y,z} G E(TL n ), and for each pair {x, there are at 
most A2("H n ) such y. Thus we must remove at least (3 5 n vertices from A before returning to 
Case 2, and hence the algorithm can pass through Case 2 at most 1//3 5 times before the set 
A shrinks to size an, as claimed. 

To prove the first statement, note that each time we pass through Case 1 on two successive 
steps of the algorithm, we remove at least (3 4 d vertices of A in the first of these. Indeed, 
since Qt[A] (for the second step) has average degree at least (3 A d, then by the definition of 
the max-degree order, the vertex we removed in the first step must have had forward degree 
at least (3 A d. By the argument above, there are at most 1//3 5 steps at which this fails to 
hold, and therefore the algorithm passes through Case 1 at most 

n 1 2n 



times, as claimed. □ 

The next claim is a simple consequence of Claim [1] and our choice of d. 

Claim 2. If C ^ 3//3 7 , then \S\ ^ (3 2 m at the end of the Main Algorithm. 

Proof. Each time the Main Algorithm passes through Case 1, \S\ increases by one; each time 
it passes through Case 2, \S\ increases by at most d. Thus, by Claim [1] and our choice of d, 

2n d m 2m 2m ~ 

if C ^ 3//3 7 , as claimed. □ 

We next make the key observation that the set S contains all the information we need to 
recover the final set A produced by the algorithm. 

Claim 3. The set A is uniquely determined by the set S of selected elements. 

Proof. This follows because all steps of the Main Algorithm are deterministic, and every 
element of I which we need to observe is placed in S. Indeed, in Case 1 we observe only that 
Vi G /, and that the elements v i, . . . , v j_i ^ /. Since v i G S and V\ . . . , G" S, this can be 
deduced from S. In Case 2, the set Z does not depend on /. If at most d — 1 elements of 
Z are in S, then we are in Case 2(a) and the remaining elements of Z are in X; otherwise, 
we are in Case 2(6) and the first d elements of S D Z (in the order on V(TL n )) form the set 
T. Thus, inductively, we see that at each stage of the algorithm, the set A is determined by 
the set S. □ 

After all this preparation, we are ready to prove Theorem 14.11 



Proof of Theorem \4-l\ Let TL = (H n ) ne N be an (a, £>)-stable sequence of 3-uniform hyper- 
graphs as in the statement of the theorem, let 5 > be arbitrary, and choose 7 = 7(01, 5) > 
to be sufficiently small. Since TL is (a, i3)-stable, there exists > such that if |/| ^ 
(a — 2[3)n, then either \I\B\ < for some B G B n , or I is not an independent set in TL n . 
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In particular, note that if 7 < 8/(a — 2(3), then either m ^ (a — 2(3) n or SF^("H n , B n , m) 
is empty. We choose such a (3 sufficiently small, set C = 3/(3 7 , and choose e = e((3) > to 
be sufficiently small. Finally, let n be sufficiently large. 

Applying the Main Algorithm to each independent set I G SF^\l-L n ,B n ,m), that is, to 
every independent set in T-L n such that 1 1\ — m and \I\B\ ^ 5m for every B G B n , we obtain 
(for each such /) a pair (A, S) such that S C I C A U S. There are two cases to deal with, 
corresponding to the two possibilities that can occur at the end of the Main Algorithm. We 
first show that if \A \ B\ ^ jn for some B G B n then a much stronger bound holds. 

Claim 4. There are at most 5 m \B n \ f" 6 ™") sets I G SF^ ('H n} B n} m) such that the Main 
Algorithm ends because \A\B\ ^ for some B G B n . 

Proof. We claim first that the number of such sets I is at most 

ft 2 "™ 1 / \ / \ / II YD 

n\ I \ \\B n 



BeB n t=0 r^Sm 



£ E £ ( t ) ( r _t) ( m _ r r ( 4 ) 



Indeed, let S and A be the selected and available sets at the end of the algorithm, set t = \S\, 
and recall that t ^ (3 2 m by Claim [2J We have at most m choices for 5* and, by Claim [31 
the set 5* determines the set A. Let B G B n be such that \A \ B\ ^ and recall that 
\I\B\ ^ 5m by our assumption on /. Thus we must choose the set B G B n , at least 5m — t 
elements of A \ B, and the remaining elements from B. 

Note that t ^ 5m/ 2 by our choice of (3 and so either m ^ (2 / ~f/5)n or the number of 
choices for / is zero. Since ||-B n || ^ an and 7 is small, it follows that the summand in (jlj) is 
maximized exactly when r = 5m. Now, using the inequalities (™) ^ (^) and 

»!.)<(^)'G> 

which holds for every a > b > c ^ 0, and since t ^ 5m/2, m ^ {2^/5)n ^ (a/2)n and 
||i3 n || ^ an, we can bound each summand in (j4|) from above by 

/en\ t f 2cyn\ <5m ~* / m \ <5m /||$ n | 
V t ) \ 5m ) \ an — m J \ m 

Since t ^ 5m/2 and 1 1— >■ (c/i)* is increasing on (0, c/e), this is at most 



/kV /2e 7 n 2mY m /||S n ||\ < /IV 
\27t / \ 5m any \ m J ^ \7 / 



- ^ — - ^V m/2 /||B n ||\ ^ 52m /||B n | 

a5 1 \ m 1 ^ \m 



if 7 < (a<5/4e) 2 • 5 4 / 5 . Since m 2 • <5 2m ^ 5 m , the claim follows. □ 
Finally, we deal with the case in which \A\ ^ (a — (3)n for some B G B n . 

Claim 5. There are at most 2" £m ( 11 ^ 11 ) sets I G SF { *\H m B n ,m) such that the Mam Algo- 
rithm ends because \A\ ^ (a — (3)n. 
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Proof. As in the previous claim, we have t = \S\ ^ (3 2 m, by Claim [21 and the set S determines 
the set A, by Claim El Thus, the number of choices for I is at most 

Now, recall that either m ^ (a — 2/3) n or SF^("H n , B n , m) is empty. Thus, estimating each 
summand in ([6]) as in the proof of Claim HJ we obtain 

n\ f(a — /3)n\ ^ ( en \ ( m \* f( a ~fi) n \ <- f &m\ * /(a — /3)n 
t)\ m-t ) ^ \TJ \(a - (3)n-m) \ m J ^ V m 

Now, using the inequality ^ (^) c ( a ), which is valid for all a > b > c ^ 0, and recalling 
that t ^ /3 2 m and that t h-» (c/t)* is increasing on (0, c/e), we get 

em\ ((oc — p)n\ ^ ( e\ (a — p\ ( ocn\ ^ ( e y -pm/a( an 



f3t) V m ) ^ \j8v ' « j \mj^ W ' V™ 

Since ||£? n || ^ an, the right-hand side is at most — ■ 2~ em ( l|i ^ 11 ) if ft > and £ = e(/3) > 
are sufficiently small, as required. □ 

Combining Claims H] and |5j we obtain Theorem 14.11 □ 

5. JANSON ARGUMENT 

In this section, we shall complete the proof of Theorem 12.21 by showing that, under certain 
conditions, almost all independent (i.e., sum-free) sets I of size m in T-L n either satisfy I C B 
for some B G B n , or 1 1 \ B\ ^ 5m for every B G B n . The key properties of % which we 
will use are that A 2 {T-L n ) = 0(1), and that 5(H n ,B n ) = Q(n); our key tool will be Janson's 
inequality. An argument similar to that presented in this section was used in P to study 
sum-free sets in random subsets of Abelian groups. 

Given a hypergraph T-L n , a family of sets B n and 5 > 0, we define 

SFf(U nj B nj m) := { / G SF(H n ,m) : \I\B\ < 8m for some B G 

The following proposition shows that, if 5 > is sufficiently small, then almost all indepen- 
dent sets in SF^\jl n , B n , m) are contained in some B G B n . We write 2 B to denote the 
power set of B, i.e., the family of all subsets of B. 

Proposition 5.1. Let a > 0, letTi = (H n ) ne N be a sequence of 3 -uniform hypergraphs with 
^(Hn) — 0{1), and let B = (B n ) n£ n be a family of sets with \\B n \\ ^ an. For every (3 > 0, 
there exists constants 5 > and Cq > such that the following holds. If 5{fH n , B n ) ^ fin 
and C ^ Cq, then 

SFf(U n ,B n ,m)\ |J 2 B \ <: n- c \B n niBn 

BeB n 



m 



for every m ^ C \Jn log n . 



14 



NOGA ALON, JOZSEF BALOGH, ROBERT MORRIS, AND WOJCIECH SAMOTIJ 



Note that 5 and C in the statement of Proposition 15 . II may depend on B, a and (3. We 
begin by recalling the Janson inequalities, and some basic facts about the hypergeometric 
distribution. 

5.1. The hypergeometric distribution. The following well-known inequality (see [20| 
page 35], for example) allows us to deduce bounds in the hypergeometric distribution from 
results on product measure. For completeness we give a proof. 

Lemma 5.2 (Pittel's inequality). Let m, n e N, and set p = m/n. For any property Q on 
[n] we have 

P(<2 holds for a random m-setj ^ 3y/m ■ P(Q holds for a random p-subset of [n]J. 
Moreover, if Q is monotone decreasing and m ^ n — 1, then 

P(Q holds for a random m-setj ^ C • P(Q holds for a random p-subset of [n]) 
for some absolute constant C > 0. 

Proof. For the first part, simply note that a random p-subset of [n] has size m = pn with 
probability at least l/(?>yjm). If Q is monotone decreasing, say, then we apply the 'Local 
LYM inequality' to Q m , the set of m-sets in Q, and deduce that 

P(<2 holds for a random k-set) ^ P(<2 holds for a random m-set) 

for every k ^ m. It is well-known that the median of the binomial distribution lies between 
\_pn\ and \pn\ , and if m ^ n — 1 then it is easy to see that Bin(n,p) = \pn\ has probability 
at most (1 - l/n) n - 1 1/e as n — > oo. Thus, if m ^ n — 1 and n is sufficiently large, then 
a random p-subset of [n] has size at most m = pn with probability at least 1/2 — 1/e + o(l) 
as n — > oo, and the result follows. □ 

The following result is an easy corollary of Janson's inequality (see |H[20]), combined with 
Pittel's inequality. 

Lemma 5.3 (Hypergeometric Janson Inequality). Suppose that {Ui}i e i is a family of subsets 
of an n-element set X and let m G {0, . . . ,n} . Let 

pi = J2(m/ny Ui] and A = ^(m/n) 1 ^^ 1 , 

where the second sum is over ordered pairs such that i ^ j and Ui R Uj ^ 0. Let R be 
a uniformly chosen random m-subset of X . Then 

F(Ui <£R for alii el) ^ C ■ max je"^ 2 , e-^ /(2A) } , 

where C > is the constant in Pittel's inequality. 
We now return to the proof of Proposition 15.11 
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5.2. Proof of Proposition 15.11 We begin by partitioning SF< (% n , B n , m) according to 
the set B G B n such that \I\B\ ^ 5m, and also according to the set S = I\ B. (Technically 
there could be more than one such set B, so in fact this might be a cover, rather than a 
partition.) Set 

I(B,S) := {/ G SF(H n ,m) : I\B = s} . 
We shall prove the following lemma. 

Lemma 5.4. LefH = (H n )„ e N be a sequence ofZ-uniform hypergraphs with A 2 ("H n ) = 0(1). 
For every sufficiently small /3 > 0, there exists a Cq > such that the following holds. Let 
C ^ Cq, let B C [n] wrf/i 5(7i n , B) ^ /3n ; and Zet 5 C [n] \ B. Then, writing k = \S\, 

mS) « (\B\-^ + ^)(f_} k ) 

for every m ^ C\Jn logn. 

In order to prove Lemma I5.4[ we shall apply the following lemma to the Cayley graph 
of S, restricted to B. The lemma is a straightforward consequence of the Hypergeometric 
Janson's inequality. 

Lemma 5.5. For every (3 > 0, there exists a constant C > such that the following holds. 
Let Q be a graph on n vertices with maximum degree at most d. If 

e{Q) > A(3dn, 

then for every C ^ Cq, 
for every m ^ C\Jn log n . 

Proof. Let {Ui}i £ j be the collection of pairs of vertices which span an edge of Q, so C/j ^ R 
for alH G / if and only if R is an independent set in Q. It is easy to see that, letting « and 
A to be the quantities defined in the statement of Lemma 15.3} 

Thus, by our bounds on e{Q) and m, and assuming C ^ 4/(3, 

2 

Lt Tfl 

u ^ ACdhgn and 4- ^ e(<?) — > 4/3m. 

A an 

By the Hypergeometric Janson Inequality, 

-ft/2 - M 2 /(2A)\ 



I(g,m)/(^j <: C-maxje-^ 2 ^- 

^ C-max jn~ 2Cd ,e- 2/3m } ^ max |n" 



Cd -/3m 

1 t> 



as required. □ 
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Recall that, given 7-L n , the Cayley graph Q s of S is defined to be the graph with vertex 
set V(H n ) and edge set 

E(Q S ) = ({u,v} C V(H n ) : {u, v,w} G T-L n for some w G sj. 

In order to apply Lemma [531 we shall need the following easy property of the Cayley graph. 

Observation 5.6. A(Q S ) < \S\A 2 (n n )- 

We can now easily deduce Lemma 15.41 from Lemma 15.51 and Observation 15.61 

Proof of Lemma \5.4\ If I is an independent set in H n containing S, then / \ S is an inde- 
pendent set in Qs, so 

I(B,S) ^ l(Q s [B],m-k), 
where k = \S\. Choose (3 > sufficiently small so that A 2 ("H„) ^ 1/(2/3), recall that 
5(H n ,B) ^ 0n, note that d = A(Q S ) ^ \S\A 2 (U n ) ^ \S\/(2p), and observe that therefore 



Thus, by Lemma |5.5[ if (3 < 1/10 then d ^ 5151 = 5k and 

i(g s[ B], m -k) < (lBr 5C ' ! + ^ ,m )(J! l fc ). 

for every m ^ CVnlogn, as required. □ 

Finally, let us deduce Proposition 15.11 from Lemma 15.41 

Proof of Proposition \5.1[ Summing over all sets B G B n and subsets S C [n]\5, and applying 
Lemma 15.41 we have 

6m 

SFf(H n ,B n ,m)\ |J 2 B \ < E J ( s > 5 ) 

Bet3„ B£B n k=l SC[n]\B:\S\=k 

6rn 



fc=l 

for every m ^ Cy/n logn, since 5(T-L n ,B n ) ^ /3n and A 2 (H n ) = 0(1) together imply that 
|-B| = O(n) for every B G $ n . We consider three cases. 

Case 1: If n~ A{ - c+k) > e~^ m , then 



k J \ J \m — k J \ m 

■"»(£!) « ©ft 1 )- 

Case 2: If n - i ( c + k ) ^ e -/3 3m anc i m ^ an/2, then by (JSJ) we have 

l^nll \ ^ ( m \ f\\Bn\W ^ {2m\ k (\\B ri 



m — k J ^ \ an — m) \ m ) ^ \ an / ^ m 
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n )( n -Hc +k) + e -^ m \(\m\ ^ 2 . e _,a m (2em\ k (\\B n \\\ ^ e ^ m/2 (\\B n \ 
k/\ ) \m — k j V ak I \ m I \ m 



since ||£> n || ^ ocn. Thus, using the bound (£) ^ (^) , we have 

( 

if 5 = 5(a, (3) > is sufficiently small, since k ^ 5m. 

Case 3: If ||i3„|| _4 ^ c ' +A: - ) ^ e -^ 3m an d m ^ an/2, then we again use the (trivial) bound 
O < C) ("t'^ to obtain 

n\ f n _ 4(c+k) + e -p m \f\\Bn\\ \ < 2e -^ m fA 2 n\B n \\\ < e ^s m/2 f\\B n \ 



kj \ J \m — kj \kj \ m J \ m 

if 5 = 5{a,(3) > is sufficiently small, since g) ^ ( 2m / a ) < (|§) 5m ^ e ~P 3 ^ for ^ 5m. 
Since e - ^ 3 " 1 / 2 <C r7 _2C for m ^ Cv'niogn, the claimed bound follows. □ 

We finish this section by observing that Theorem 14.11 and Proposition 15.11 together imply 
Theorem O 

Proof of Theorem \2.£\ Let H = ('H n ) ne N be a sequence of 3-uniform hypergraphs which is 
(a, £>)-stable, where B = (B n ) n£ fq is a family of sets, and a > 0. Suppose that a(/H n ) ^ 
\\B n \\ ^ an, and that there exists (3 > such that e(H n ) ^ /3tt, 2 , A 2 (Hn) ^ 1//3, \B n \ ^ n 1///3 
and 5(H n ,B n ) ^ /3n for every n G N. Let 5 = 5(f3) > be sufficiently small, and let 
C = C{(3, 5) > be sufficiently large. We claim that if 

^ CV77 fog 77. 

then almost every independent set in % n of size m is a subset of some B £ B n . 

Indeed, by Theorem 14.1} the number of independent sets / in 1-L n of size m for which 
| / \ B | ^ 5m for every B e B n is at most 



for some £ > and by Proposition 15.11 the number of such sets for which 1 ^ \I\ B\ ^ 5m 
for some B £ B n is at most 

n-°\B n \( m 

\ 777 

Since \B n \ ^ n 1 /^, C > 1/(3, and a(ft n ) ^ ||B n ||, the result follows. □ 

6. Abelian groups of Type I 

In this section, we shall use Theorem 12.21 to prove Theorem 11.11 for all q > 2. We remark 
that the proof below can also be adapted to cover the case q = 2; however, since we shall 
give a different proof of the case q = 2 in Section [7J we leave the details to the reader. (If 
T-L n denotes the hypergraph that encodes Schur triples in a group G of even order n and B n 
denotes the collection of maximum-size sum-free subsets of G, then it is not always true that 
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Q(7i n ,B n ) = Q(n). This problem can be easily overcome by considering triples of the form 
(x,x,2x), cf. the proof of the 1-statement in [SI Theorem 1.2].) 

In order to prove that our hypergraph is (a, B)-stable, we shall use the following re- 
sult (see [SI Corollary 2.8]), which follows immediately by combining results of Green and 
Ruzsa [18] and Lev, Luczak, and Schoen [26]. Let SFo(G) denote the collection of maximal- 
size sum-free subsets of G and recall that each B G SF (G) has size /jl(G)\G\. 

Proposition 6.1. Let G be a finite Abelian group of Type I(q), where q = 2 (mod 3) and 
let < 7 < 7(g) and < (3 < /3q(7, q) be sufficiently small. Let A C G, and suppose that 

\A\ > (p(G) - P)\G\. 

Then one of the following holds: 

(a) \A\B\^ j\G\ for some B G SF (G). 

(b) A contains at least (3\G\ 2 Schur triples. 

We shall also use the following classification of extremal sum-free sets for Type I groups. 

Theorem 6.2 (Diananda and Yap [H]). Let G be a finite Abelian group of Type I(q), where 
q = 2 (mod 3). Then every B G SF (G) is a union of cosets of some subgroup H of G of 
index q, B/H is an arithmetic progression in G/H , and B U (B + B) = G. 

In other words, for every B G SF (G), there exists a homomorphism ip: G — > Z g such that 
B = ip~ l {{k + 1, . . . , 2k + 1}), where q = 3k + 2. 

Combining Theorem 16.21 with Kronecker's Decomposition Theorem, we easily obtain the 
following well-known corollary. 

Corollary 6.3. Let G be an arbitrary group of Type I. Then |SFo(G)| ^ 

It is now straightforward to deduce Theorem 11.11 from Theorem 12.21 Proposition 16.11 and 
Corollary 16.31 

Proof of Theorem \l.l\ for q ^ 2. Let q = 2 (mod 3) be an odd prime, let C = C(q) be 
sufficiently large, and let G n be an Abelian group of Type 1(g), with \G n \ = n. We shall 
show that if m ^ C(q)y/n\ogn, then almost every sum- free set of size m in G n is contained 
in a member of SF (G). 

We begin by choosing an infinite set X C N such that, for every n G X, q is the smallest 
prime divisor of n with q = 2 (mod 3). For each n G X, let G n be an Abelian group of 
Type 1(g), with \G n \ = n, and define H = {T-L n )nex to be the sequence of hypergraphs on 
vertex set V(H n ) = G n which encodes Schur triples. To be precise, let V(7i n ) = G n , let 
{x, y, z} G (^ n ) be an edge of T-L n whenever x + y — z, and observe that every sum-free 
subset of G n is an independent set in "H n EI Let B n = SF (G n ), the collection of maximum 
size sum-free subsets of G n , and recall that \B n \ ^ n, by Corollary 16.31 

We claim that ~H and B satisfy the conditions of Theorem 12.21 Indeed, 1-L n is 3-uniform, 
has 0(n 2 ) edges, and satisfies A 2 ('H n ) — 3. Setting a = /i(G), we have a(T-L n ) = \\B n \\ = an 

4 But not vice-versa, since T-L n does not contain the Schur triples in G n of the form (x,x,2x). Thus, by 
bounding I (Urn m ) we are m f &c t proving a statement which is slightly stronger than Theorem ll.il 
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and \B n \ ^ n, as observed above. Moreover, the statement that T-L is (a, £>)-stable is exactly 
Proposition 16.11 Thus it will suffice to show that 5(H n ,B n ) = Q(n). 



Claim. For each B £ SFq(G) and every x G G \ B , 



n 1 
^ Yq ~ 2" 



Proof of claim. Let S G SF (G) and let x G G\B. By Theorem I6.2[ there exists a subgroup 
if of G of index g such that B is a union of cosets of H and B U (B + B) = G. It follows 
that x = y + £ for some y,z G 5, and that y + h,z — h £ B for every h £ H. Thus, 

{y + /i, z - fc} G C(ar) := | {y, 2} G Q J : a; = y + z J 

whenever h £ H and y + h ^ z — h. Moreover, since |G| is odd, there is at most one h £ H 
such that 2/i = z — y, so |C(x)| ^ — l)/2 = n/2g — 1/2, as required. □ 

Thus the pair ("H, B) satisfies the conditions of Theorem 12.21 and hence if C(q) is sufficiently 
large and m ^ C(q)y/nlogn, then almost every sum-free set of size m in G n is contained in 
some B G B n , as required. 

Finally, let us deduce that if G is an Abelian group of Type l(q), and m ^ C(q)y/n\ogn, 
then 

|SF(G,m)| = - ■ (#{elements of G of order q\ + o(l)) ( . 

2 \ m J 

Indeed, it suffices to observe that |SF (G)| = # {elements of G of order g}/2, by Theo- 
rem E21 and that each pair B, B' £ SFo(G) intersect in at most (1 — l/g)//(G)|G| elements. 
The result now follows from some easy counting. □ 

7. Abelian groups of even order 

In this section, we shall prove the following theorem, which implies Theorem 11.11 in the 
case q = 2. We shall use Theorem 11.31 and some ideas from Section EJ but otherwise this 
section is self-contained. In particular, we shall not use Proposition 16.11 and thus we give a 
new proof of the main theorem of and 



Theorem 7.1. If G is an Abelian group of order n, then 

'n/2 



|SF(G,m)| = {^{elements of G of order 2} + o(l)) 



m 



for every m ^ log n. 



We remark that we shall prove the theorem for all finite Abelian groups, not just those of 
even order. We begin by partitioning the collection of sum-free sets into two pieces. Given 
an Abelian group G, let 

SFf (G,m) := {/ G SF(G,m) : \I n H\ ^ 5m for some H < G with [G : H] = 2 j, 
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and 

SF^(G,m) := {/ G SF(G,m) : |/n H\ > 5m for every H < G with [G : F] = 2}. 

Note that if |G| is odd then SF^(G, m) is empty. We shall prove the following proposition 
using the method of Section [5J 

Proposition 7.2. Let G be an Abelian group of order n, and let 5 > be sufficiently small. 
Then 

\SFf (G,m)| ^ (#{e/ements o/G of order 2} + o(l)) - ' 



/// 



/or every m ^ 4a/tz logn. 

For sets in SF> (G, m), i.e., far from any H ^ G of index 2, we shall prove the following 
stronger bound using Theorem 11.31 

Proposition 7.3. Let G be an Abelian group of order n, and let 5 > 0. If e — e(5) > is 
sufficiently small and C = C(5) is sufficiently large, then 



|SF^(G,m)| ^ 2" 

' m 

/or every m ^ Gi/n and every sufficiently large n G N. 

We begin by proving Proposition 17.21 In this section, we shall use a slightly different 
notion of Cayley graph than that used earlier. Given S C G, define to be the graph with 
vertex set G\S and edge set {xy: x — y G S 1 }, and note that if I is a sum- free set in G with 
SCI, then i \ 5 is an independent set in Q* s . 

Proof of Proposition pO[ Let G be an Abelian group of even order n, let if be a subgroup 
of G of index 2, and let S Q H satisfy \S\ — k ^ 5m. Set 7 = 1/65. We claim that for every 
m ^ A^n logn, there are at most 

n -4k + e -Tm\( n/2 \ 

J \m — kj 

sum-free subsets I of G of order m with I C\ H = S . 

Observe first that the graph Qg[G\H] is rf-regular, where d = \SU(—S)\ G [k, 2k}. Indeed, 
for each x G G \ H, let 

iV(x) = {y eG\H : i-i/6Sor!/-xG5}. 

Since S C if, it follows that x — 5 and x + S 1 are in G \ H, and hence |iV(x)| = \S U (— S")!, 
as claimed. Since \S\ = k, we have k ^ d ^ 2k. 

Now, by the Hypergeometric Janson Inequality, Lemma 15.31 there are at most 

G-max{e- fcm2 / 4 ",e- m / 64 }^/ 2 ^ < ( n ~* + e ~^) (jf^ X 
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independent sets of size m — k in Qg[G \ H]. This follows because k ^ 5m, so 

/ kn\ f{m — k) 2 \ km 2 , , f 2k\ n f (m — k)' i \ 8k 2 m 3 



and m ^ 4\/n logn. Since each sum-free subset ICG induces an independent set in 
g* s [G\H], then © follows. 

Finally, summing (0) over subgroups if and sets S", we obtain 

5m 



|SF?(G,m)| < [G : //] = 2} £ ("Vn"« + e"^) ( ^ ) 

< ^{elements of G of order 2} + O (J^J J 



'n/2 
v m 

for every m ^ 4^/n logn. To see the last inequality, observe that the number of subgroups 
H of index 2 in G is exactly the number of elements of G of order 2 and consider three 
cases as in the proof of Proposition 15.11 Indeed, if n~ 4fc ^ e -7 " 1 or m ^ n/4, then each 
summand in © is at most (n" 2fc + e" 7m / 2 ) f^ 2 ) by the trivial bound (™ /2 k ) < (™) ( n ^) . But 
if n~ 4fe < e _7m and m ^ n/4, then by (JSJ), 



n/2 \ ^ /eny/ 2m V fn/2\ ^ /4em\Vn/2\ ^ e o(Vfo.) (V 2 
A; y \m — A;/ ^ \ A; / \n — 2m / \m J ^ \ k J \ m J ^ \m 
since ^ 5m. Thus, if 5 > is chosen small enough, then each summand in (JS} is at most 
e ~ W2 ("m ) > as squired. □ 

We next turn to the proof of Proposition 17.31 We shall divide into two cases: either the 
smallest eigenvalue A(I) of I (see below) is at most (5 — 1)\I\, in which case we shall use 
some basic facts about characters of finite Abelian groups to show that there are few such 
sets; or A (I) is larger, in which case we shall find a small subset SCI such that Q* s is a 
d-regular graph with smallest eigenvalue satisfying A > (5/4 — l)d, and apply Theorem 11.31 
We begin with the following key definition. 

Definition 7.4 (The smallest eigenvalue of S). Given a finite Abelian group G, and a subset 
£ S C G, let 

X(S) := min |Re(A) : A(S)v = \v for some v ^ 0}, 

where Re(A) is the real part of the complex number A, A(S) is the adjacency matrix of the 
directed Cayley graph on G, i.e., the (0, l)-matrix with A(x,y) — 1 iff y — x 6 S, and is 
the zero vector. 

Next, we recall some simple properties of characters of finite Abelian groups. 
7.1. Characters of finite Abelian groups. 

Definition 7.5. A character of a group G is a homomorphism from G into the multiplicative 
group of non-zero complex numbers, i.e., a function \ : G — > C* such that x(a+6) = x( a )x(^) 
for all a,b £ G. 
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A character x is called trivial if xi x ) = 1 f° r & U £ G G; we will denote the trivial character 
by xt- The set of all characters of G is denoted by G. The following statement establishes 
a relation between the smallest eigenvalue of the matrix A(S) and the characters of G. 

Lemma 7.6. For every (jL S C G, 



min | Re fj^x( s )l 



X(S) = min V y( S ) : x^b 



We shall use the following facts about finite Abelian groups in the proof of Lemma 17.61 

Fact 1. If G is a finite Abelian group of order n, then all its characters take values in the 
set 

U n := {(£ n ) k : ke{0,...,n-l}}, where £ n = e 2m/n , 
of nth roots of unity. Moreover, if x G G then range(x) = £4 for some k = k(x)- 
Fact 2. G is an orthogonal basis of the vector space C G . 

Proof of Lemma \ 1. b\ Let us start by breaking up the adjacency matrix A(S) into \S\ pieces 
as follows: 

^-^ ( 1 if y — x = s 

M s ) = 2_^A S , where A s (x,y) = 



otherwise. 



Thus, for every s, x G G and x G G, 

(A s x)(x) = x(x + s) = x(x)x(s) = (x(s)x)(aO, 
and so every x G G is an eigenvector of each A s with x{ s ) being the corresponding eigen- 
value. Hence, every x G G is an eigenvector of A(S), with eigenvalue ^ se5 x( s )- Since the 
characters of G form an orthogonal basis of C G , it follows that the set of eigenvalues of A(S) 
is exactly 

The inequality X(S) ^ — \S\ follows since |x( s )| — 1 f° r every s G S and x G G. □ 

Let us note for future reference the following fact from the proof above. 

Lemma 7.7. For every G" S C G, i/ie characters of G form a basis of eigenvectors of the 
matrix A(S). 

7.2. Sum-free sets with small smallest eigenvalue. Using the properties described 
above, we shall prove the following lemma. 

Lemma 7.8. Let 5 > be sufficiently small and let n G N be sufficiently large. Then, there 
exist constants e = e(S) > and C = C(5) such that, for every m ^ Cy/n, 

V2" 



{/G SF^(G,m) : A(/) < 



^ 2 



m 
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Proof. For each character x of G and each A C G, define X(A,x) = Re (XLeA • We 
shall bound the number of I G SF^(G, m) such that X(I,x) ^ — l)|i| by 

V2 N 



The desired bound will follow since X(I) = min x6( j X(I, x) and there are at most \G\ char- 
acters of G. We split into two cases, depending on the number of different values taken 
by X- 



Case 1. |range(x)| = 2. 

Since x is a group homomorphism, it corresponds to a subgroup H of G of index 2, namely, 
if = Since \I C\ H\ ^ <5n for every such H, we have 

X(I, x) = Re f g x (x) j = \lnH\-\I\H\ ^ (25 - l) |J] 
and hence in this case, there are no i G SF> (G, m) such that X(I, x) ^ (8 ~ l)|f |- 



Case 2. |range(x)| ^ 3. 



Let = |range(x)| and recall (from Fact [T]) that range(x) = £4, where £4 is the multi- 
plicative group of kth roots of unity Observe that — n/k for every £ G £4, and 
consider, for each ( on the complex unit circle S 1 , the open arc Cf of length 7r/3 centred at 
( on 5 1 . Set : = x _1 (C^), and note that |Cf fl Uk\ ^ fc/3 for every ( G S 1 , and hence 
| iff | ^ n/3. Note also that, even though there are infinitely many C^, there are at most 2k 
different sets K^. 

Let c > and suppose first that there exists ( G S 1 such that \Kt D i| ^ (1 — c)\I\. The 



number of such sets i is at most 



n 



\Kt 



m 



— c)m J \ cm J \ 3 



n/2 
m 



where c'fc) -)• as c -> 0. 



So suppose that jiff ni| ^ (1 — c)|i| for every ( G S 1 . We claim that, if 5 > is sufficiently 



small, then 



|A(/,x)| = ^ (l-c + ccos(7r/6)).|J| < (l-5)|i| 



xei 



(9) 



To see this let v = ^2xeiX( x )i no ^ e ^at if v = then we are done, and otherwise observe 
that, by our assumption, xi x ) can ue within the open arc of length 7r/3 centred in direction 
v for at most (1 — c)\I\ elements x G i. Since each of the others contribute at most cos(7r/6) 
in the direction of v , follows. This is a contradiction, so the proof is now complete. □ 
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7.3. Sum-free sets with large smallest eigenvalue. We shall prove the following state- 
ment using Theorem II .31 Together with Lemma T7.8I it will easily imply Proposition 17. 3^ and 
hence Theorem 17. II 

Lemma 7.9. For every finite Abelian group G and every 5 > 0, there exist e = e(5) > 
and C = C(5) > such that 

h e SF^(G,m) : A(I) > (tf-l)|J|} 

for every m ^ C\fn. 

The idea of the proof is as follows: we choose a set S C J of size em and observe that, 
since / is sum-free, I \ S is an independent set in Q* Sl the Cayley graph of S. The key point 
is that, for some such S, our bound on A(J) implies the existence of a non-trivial bound on 
X(Qg), the smallest eigenvalue of the adjacency matrix of the Cayley graph of S. Combined 
with Theorem 11.31 this implies that there are only very few choices for I \S, and hence for 
/ itself. 

The first step is the following lemma, which shows that our bound on X(I) allows us to 
find a small set S such that A(S')/|S'| is also bounded away from minus one. 

Lemma 7.10. If I & SF> (G, m) satisfies X(I) ^ (5 — then there exists a set S C / 

of size em such that 

X(S) > (10) 

Proof. Recall the definition of X(A, x) from the proof of Lemma I7T81 Since X(I) ^ (5 — 1)|/|, 
it follows from Lemma EH] that A(J, x) ^ (5 — 1)|/| for every \ £ G. Choose a subset SCI 
of size em uniformly at random; we claim that X(S,x) is tightly concentrated around the 
mean, i.e., around eX(I, x)- Indeed, by Chernoff's inequality, we have 

p(A(S,xK(5/2-l>|/|) ^ e-^ m \ 

where the implicit constant depends on e and 5. There are exactly n characters in G, and 
so, by the union bound, the probability that S does not satisfy (flOj) is at most 1/2. Thus 
there exists a set S as claimed. □ 

Next, we show that this bound on X(S) implies a similar bound on X(Qg), the smallest 
eigenvalue of the adjacency matrix of the Cayley graph of S. Recall that the adjacency 
matrix of Q* s is A[S U (— S)), and hence 

x{g* s ) = x(su(-s)). 

We shall use the following lemma, which bounds X{Gs) m terms of X(S). 
Lemma 7.11. Let £ S C G and 5 > 0. If X(S) ^ (5-l)\S\, then 

X(g* s ) 2 (j-l)\SU(-S)\. 
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Proof of Lemma \7.11\ By Lemma [777^ the characters of G are a basis of eigenvectors of both 
A(S) and A((-S) \ S). Thus, by Lemma EH 

\(g* s ) = \(su(-s)) = x(s) + x((-s)\s) 

> (5-l)\S\-\(-S)\S\ > ( 6 --l)\Su(-S)l 



2 

as required. The last inequality follows from the fact that \(—S) \ S\ ^ |(— S)\ = \S\. □ 

We can now complete the proof of Lemma 17.91 

Proof of Lemma 7.9. Let / G SF^\G,m) and suppose that ^ (5 — By Lem- 

mas EH]] and EH1 there exists a set S C I with 15*1= em, such that 

KQs) > (j-i)\su(-s)\. 

Since / is sum-free, I\S is an independent set in Q* s . We claim that Q* s satisfies the conditions 
of Theorem II .31 Indeed, Q* s is a dg-regular graph on n vertices, where ds = \S U (— S)\, and 

1X1 v ' em \SU {S)\ 

since m ^ Cy/n. Note also that 

|A(0S)| < 1 ~ (S/A) < 1 * 



Hence, by Theorem 11.31 



and so 



ds + \HG*s)\ 2 -(5/4) ^ 2 16 
r,„. / x x /(1/2-5/20W 



{-s F?(G , m) :^).( 1 - W i}H(; m )( (1 f_r")^-*"(l 2 ) 

if e = > is sufficiently small. This proves the lemma. □ 
Finally, note that Lemmas 17.81 and 17.91 imply Proposition 17.31 

7.4. Proof of Theorem 17.11 First, observe that if \G\ is even, then the claimed lower 
bound on |SF(Cr, m)\ is a straightforward consequence of the fact that, by Theorem 16.21 
|SFo(G)| = # {elements of G of order 2} and that each pair of distinct B,B' e SF (G) 
intersects in \G\/4 elements. If \G\ is odd, then Theorem 17.11 only gives an upper bound on 
|SF(G,m)|. 

For the upper bound, observe that by Propositions 17.21 and 17.31 we have 
|SF(G,m)| ^ |SFj } {G,m)\ + \SF f{G,m)\ 

< (#{elements of G of order 2} + o(l)) ) + 2~ Em ( n ^ 2 

\ m ) \ m 
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for every m ^ 4^/n\ogn, as required. 
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